IS

Wei, Chih-Ping

Topic Weight Topic Terms
0.461 office document documents retrieval automation word concept clustering text based automated created individual functions major
0.174 applications application reasoning approach cases support hypertext case-based prototype problems consistency developed benchmarking described efficient
0.141 information management data processing systems corporate article communications organization control distributed department capacity departments major
0.106 qualitative methods quantitative approaches approach selection analysis criteria used mixed methodological aspects recent selecting combining
0.104 database language query databases natural data queries relational processing paper using request views access use
0.101 personalization content personalized willingness web pay online likelihood information consumers cues customers consumer services elaboration

Focal Researcher     Coauthors of Focal Researcher (1st degree)     Coauthors of Coauthors (2nd degree)

Note: click on a node to go to a researcher's profile page. Drag a node to reallocate. Number on the edge is the number of co-authorships.

Chiang, Roger H.L. 1 Hu, Paul Jen-Hwa 1 HUANG, CHUN-NENG 1 TAI, CHIA-HUNG 1
Wu, Chia-Chen 1 YANG, CHIN-SHENG 1
document clustering 1 fuzzy clustering 1 hierarchical agglomerative clustering (HAC) 1 information retrieval 1
personalized document clustering 1 query expansion 1 supervised document clustering 1 text mining 1
topic-based query expansion 1 word mismatch 1

Articles (2)

Managing Word Mismatch Problems in Information Retrieval: A Topic-Based Query Expansion Approach. (Journal of Management Information Systems, 2007)
Authors: Abstract:
    Word mismatch represents a fundamental information retrieval challenge that has become increasingly important as electronic document repositories (e.g., Web resources, digital libraries) grow in number and sheer volume. In general, word mismatch refers to the phenomenon in which a concept is described by different terms in user queries and in source documents. Query expansion represents a promising avenue to address such problems. Previous research predominantly approaches query expansion on the basis of global or local analysis. However, these approaches emphasize a global perspective rather than taking a topic-specific view of term associations. As a consequence, their effectiveness can be severely constrained when the document corpus spans a diverse set of topics. In this study, we propose a topic-based approach for query expansion and develop and empirically evaluate two novel methods--namely, nonfuzzy and fuzzy topic-based query expansion--to address word mismatch problems. According to our evaluation results, the proposed topic-based approach is more effective than a benchmark global analysis method, particularly when user queries consist of multiple query terms.
Accommodating Individual Preferences in the Categorization of Documents: A Personalized Clustering Approach. (Journal of Management Information Systems, 2006)
Authors: Abstract:
    As electronic commerce and knowledge economy environments proliferate, both individuals and organizations increasingly generate and consume large amounts of online information, typically available as textual documents. To manage this ever-increasing volume of documents, individuals and organizations frequently organize their documents into categories that facilitate document management and subsequent access and browsing. Document clustering is an intentional act that should reflect individual preferences with regard to the semantic coherency and relevant categorization of documents. Hence, effective document clustering must consider individual preferences and needs to support personalization in document categorization. In this paper, we present an automatic document-clustering approach that incorporates an individual's partial clustering as preferential information. Combining two document representation methods, feature refinement and feature weighting, with two clustering methods, precluster-based hierarchical agglomerative clustering (HAC) and atomic-based HAC, we establish four personalized document-clustering techniques. Using a traditional content-based document-clustering technique as a performance benchmark, we find that the proposed personalized document-clustering techniques improve clustering effectiveness, as measured by cluster precision and cluster recall.